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Preface 

The  science  of  statistics  can  bo  of  groat  uso  to  goodosists  and 
photogrammetrists  in  reducing  and  analyzing  data  obtained  from  meas- 
urements.  This  thesis  will  explore  the  applications  of  statistical 
hypothesis  tests  in  the  fields  of  geodesy. 

The  data  for  examples  have  been  taken  from  various  studies  in 
the  field  of  geodetic  science.  In  each  case  the  source  of  the  data 
is  noted.   Often  the  conclusions  drawn  by  statistical  tests  will  not 
agree  with  the  conclusions  drawn  by  the  original  experimenter.   It 
is  not  the  intent  of  this  thesis  to  criticize  the  work  of  others, 
but  only  to  give  examples  of  how  statistical  tests  may  be  used  to  guide 
geodesists  in  drawing  conclusions  from  observed  data. 

The  author  gratefully  acknowledges  the  guidance  of  Dr.  Urho  A. 
Uotila  and  the  assistance  and  inspiration  of  his  wife,  Judy,  in  the 
preparation  of  this  thesis. 
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1 

CHAPTRt  1 

INTRODUCTION 

In  geodesy  and  photocrammetry,  large  quantities  of  data  are 
coJiontod  jn  the  torn  of  moapurronents  such  as  angles,  lengths,  and 
gravity  values.  It  is  imperative  that  the  relevant  information  con- 
tained in  a  mass  of  geodetic  data  bo  oxpressed  by  comparatively  few 
values.   To  accomplish  this  task,  geodesists,  as  scientists  in  other 
fields  of  physical  and  social  science,  have  found  many  solutions  through 
the  science  of  statistics. 

Statistics  is  the  study  of  populations,  variances,  and  methods 
of  data  reduction.  Some  statistical  methods,  notably  the  method  of 
least  squares,  have  found  universal  acceptance  in  geodosy.  Other 
statistical  methods  are  not  so  widely  used.  It  is  the  purpose  of  this 
thesis  to  explore  the  applications  of  statistical  hypothesis  testing  to 
the  problems  of  geodesy, 

1.1  Statistical  Theory 

A  STATISTIC  is  a  value  calculated  from  an  observed  sample  with 
a  view  to  characterizing  the  population  from  which  it  is  drawn. 
Statistics  which  will  be  of  value  to  geodesy  are: 
Population  mean         ^ 
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Sample  mean  X   =      &■ - • 
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True  variance  5  * 
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Variance  estimate1 

Fisher's  F  Statistic 

Pearsen's'X  2  Statistic 

Student's  t  Statistic 

The  standard  error 

The  estimate  of  standard  error    s 
A  statistic  which,  on  the  average,  gives  the  right  answer  is 
said  to  be  unbiased.   If  a  statistic  gives  values  which  are  concen- 
trated more  closely  to  the  right  value,  the  statistic  is  said  to  be 
efficient, 

A  widely  used  concept  in  statistical  theory  is  a  random 
variable.  A  random  variable  is  a  quantity  which  takes  on  a  definite 

value  at  every  point  of  a  sample  space.  Geodetic  measurements  are  con- 

2 
side red  to  be  random  variables, 

1.2  Density  and  Distribution  Functions 

Assume  that  a  sample  space  is  such  that  each  point  of  the  sample 

space  can  be  characterized  by  the  value  of  a  continuous  variable,  x, 

which  can  take  on  all  values  between  -co   and  +oo   .   If  the  event,  A  is 

defined  as  the  set  of  all  sample  points  characterized  by  the  inequality 

x  ^  x° 

where  x°  is  some  fixed  value.   The  cumulative  probability  distribution 

function,  F(x°) ,  and  the  probability  density  function,  ^(x)  ,  are  de- 

J-The  letter  m  is  commonly  used  in  geodetic  literature.   In  most  statis- 
tical  literature  Greek  letters  represent  true  values  and  the  Roman  equiva* 
lent  represents  the  statistical  estimate  of  this  value.   For  ease  of 
understanding,  the  statistical  convention  will  be  followed. 

2See  discussion  by  J.L.  Steam  (1964). 


fined  by  Q 

x 

P(A)  -  F(xO)  -    //(x)dx. 

—  CO 

The  density  function,  pXx)  ,  has  the  additional  significance  that 

P(x°£;  x  £  x°  +  dx)  «j0C(x°)dx. 
Since  the  total  probability  for  a  sample  space  must  equal  unity,  j3(x) 
must  include  a  normalization  factor 

F(co)  -  J   ^(x)dx  -  1. 
The  basic  requirement  for  any  probability  density  function  is 
that  this  integral  exists.  A  further  requirement  is  that 

J&x)  ^  0  for  -  <°  <  x  <  +  co  , 
There  are  many  distributions  and  density  functions.  As  an 
example  the  uniform  density  function  is  defined 

jtf(x)   -  1_  for  -  a -=  x^z   +  a 

2a 

and 

0{x)   «  0  otherwise. 
Figure  1  shows  the  corresponding  probability  density  function, 
Mx),  and  the  cumulative  probability  distribution  function,  F(x),  for 
this  uniform  density  function. 
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Figure  1 


A  quantity  is   said  to  bo  normally  distributed  whon  it  takes  all 
values  from  -<r>    to  +<x> ,   with   frequencies  given  by  a  definite  mathe- 
matical law,   namoly,    the  logarithm  of  the  frequency  at  any  distance,   d, 
from  the  center  of  the  distribution  is  less  than  the  logarithm  of  the 
frequency  at  the  oentor  by  a  quantl  ty  proportional  to  d    .      The  distri- 
bution is  symmetric  with  greatest  frequency  at  the  center  (Fisher,    1925)* 
The  density  function  of  a  normal  distribution  of  moan  m  and  variance  s-a 
is  given  by  the  expression 


r  6  -Vt 


ULP^- 


Figure  2  shows  the  probability  density  function  and  the  cumu- 
lative probability  distribution  function  for  the  normal  distribution. 
The  scale  of  x  can  be  changed  by  measuring  each  x  value  by  its  distance 
from  the  mean,  and  adopting  the  standard  deviation  6    as  a  unit  of  meas- 
urement. An  ordinate  of  this  normal  curve  is  then 

W    =  (*  ~^> 
The  quantity  w  is  called  the  standard  normal  deviate  (Handel,  1964). 
A  unit  normal  deviate  is  a  distribution  which  has  a  moan  of  0,  and  a 
variance  of  unity. 
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Figure  2 
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The  normal  distribution  is  very  useful  In  stntistics  because  of 
the  Central  Limit  Theorem.   This  theorem  may  be  expressed  as  follows: 

Given  a  population  of  values  with  a  finite  variance,  if  inde- 
pendent samples  are  taken  from  this  population,  all  of  size  N,  then  the 
population  formed  by  the  averages  of  these  samples  will  tend  to  have  a 
normal  distribution,  regardless  of  what  the  distribution  of  the  origi- 
nal population  is;  the  larger  N,  the  greater  will  be  this  tendency 
towards  normality. 

Thus  far  only  functions  of  one  variable  have  been  discussed. 
If  there  are  more  than  one  variable  associated  with  each  point  in  sample 
space,  multivariant  functions  may  be  defined.   For  example,  a  multi- 
variant  probability  density  function  is  defined  as 

;ZfUl°,  x2° xn°)dx1dx2 dxn 

-  PUj0^  xt  ^  xj0  +  dxit  i  -  1,  2,..n). 
The  right  side  of  this  equation  may  be  interpreted  as  the  probability 
that  all  the  inequalities  hold  simultaneously.   For  an  excellent  discus- 
sion, the  reader  is  referred  to  pages  16  through  18  of  Hamilton  (1964). 

1.3  Statistical  Hypotheses 

Webster  defines  a  hypothesis  as  a  tentative  theory  or  suppo- 
sition provisionally  adopted  to  explain  certain  facts  and  to  guide  in 
the  investigation  of  others  (Webster,  1954).  A  statistical  hypothesis 
is  thus  a  theory  about  some  population. 

The  only  way  that  one  can  be  absolutely  certain  of  the  truth  or 
falsity  of  a  statistical  hypothesis  is  to  examine  the  entire  population. 
Since  measurements  can  take  on  an  infinite  number  of  values,  exami- 


nation  of  tho  entire  population  in  g'odctic  applications  is  impossible. 

One  is  then  forced  to  make  a  decision  based  on  a  fow  measurements. 

Statistically  speaking  thene  measurements  are  a  sample  taken  from  the 

population.   The  procoss  of  using  this  sample  to  test  tho  truth  or 

falsity  of  a  hypothnnin  is  enlled  statistical  tests.   There  is  in  these 

tests  no  certainty  that  a  mistake  has  not  been  made.   There  arc,  in  fact, 

two  different  kinds  of  orrors  which  can  be  made.   These  are  called: 

Type  I  (c*  )  error  —  the  rejection  of  a  hypothesis 

which  is  true 

Type  II  (/&)   error  —  the  acceptance  of  a  hypothesis 

which  is  falso 

These  errors  will  be  discussed  in  detail  later. 


Table  1 

Definition  of  tho  Types  of  Errors  Associated 
with  the  Tests  of  Statistical  Hypothesos 


Decision 


True  Situation 


Hypothesis  is  Truo 


Hypothesis  is  False 


Accept  the 

Hypothesis 


No  Error 


T/po  II  Error 


Reject  the 
Hypothesis 


Type  I  Error 


No  Error 
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CHAPTER  2 

HYPOTHESIS  TESTING 

It  may  be  of  value  to  know  if  data  are  normally  distributed. 
A  simple  test  is  to  compare  tho  histogram  to  a  normal  curve  (Dixon, 
1957).  The  percentage  of  the  data  in  a  given  group  of  the  histogram 
can  be  compared  to  the  area  under  the  normal  curve  corresponding  to  the 
given  group. 

For  example;  givon  the  mean  of  a  data  sot  as  30»  and  its  stand- 
ard deviation  of  5,  80$  of  tho  data  is  found  between  20  and  35.  Is 
this  data  normally  distributed? 

The  standard  normal  deviates  for  tho  data  group  are 

X20  a  20-30  =  -2.0 
X35  =  35-30  =  1.0. 
From  Appendix  I  the  area  under  the  normal  curve  is 

from  ^00  to  1.0  =  ,8413 

from  -00  to  -2   ■  1.0  -  .9772  =  .0228 

The  area  under  the  normal  curve  =  .8185 

or  81. 85$.  This  would  indicate  that  the  data  in  this  group  has  a  distri- 
bution close  to  a  normal  distribution. 

The  %  statistic,  discussed  in  section  2.^,  may  also  be  used  to 
test  the  distribution  of  a  group  of  observations.  A  discussion  of  this 
test  will  be  deferred  until  the  statistic  has  been  introduced. 

For  an  example  of  more  elaborate  tests  for  normal  distribution 
the  reader  is  referred  to  the  paper  by  Stearn  (196*0. 


2.1  Tests  Involving  the  Normal  Distribution 


If  a  single  observation,  x,  is  made  from  a  normally  distributed 

2 
population  of  mean,  uy   and  variance, 6  ,  statistical  theory  states 

that  (Hamilton,  1964) 

<!>  w  "  (*  'M) 

has  a  normal  distribution.   The  probability  that  the  magnitude  of  w 

calculated  in  this  manner  exceeds  some  specified  value 

(2)  P(|w/  >  wr  ) 

where  w^.  is   the  value  of  w  to   the   right  of  which  lies  an  area  r  under 
the  probability  density  curve  is 

(3)  P(|w|  >  w,)   -/. 
We  can  then  write 


(4)     P(|w/  >    w.)  -       I 


V2- 


I 


-~W*       w 


+  CD 


dw  + 


f   e  "  *   dw  I  . 


From  Figure  3  we  can  interpret  the  value  of  the  integral  as  the  cross 
hatched  area  under  the  curve. 


Figure  3 
Mathematically  Hamilton  (1964),  shows 


(5) 


P(/w|  >  w,)  -  2F(  -  wy) 

-  2(1  -  F(W/)) 

-  2  IT 


The  value  of  F(w^)  can  be  found  from  a  table  of  the  cumulative  normal 
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distribution  function  (Appendix  I). 

If  wff  is  1.96 

P(|w|  p-  1.96)   -  2(1   -  0.975)   -  0.05. 
liquation  (5)   can  be  expressed  as 

(6)  P(/x  -m\  >   w^.)   -  20   -   F(wy)} 
or 

(7)  PC/f-ffw^    x^  +CTWP   -  1    -   2fl   -   F(w^ 

-  2F(w,)  -  1. 
For  the  example 

P(A-  1.96e>^  x^/<+  1.96<r)  -  0.95 
that  is,  the  probability  that  a  single  observation,  in  the  normal  popu- 
lation given,  will  lie  within  1.96So£  the  mean,  is  95%. 

The  inequality  expressed  in  (6)  forms  the  confidence  interval 
which  is  the  basis  for  the  test  of  a  hypothesis  involving  the  mean  of  a 
normal  population. 

Symbolically  this  null  hypothesis,  H0,  can  be  expressed 

(8)  H0  :  A  -4o 
and  the  alternative 

The  unit  normal  deviate  is  calculated  from  (1),  if  |  w(  >  1.96 
the  hypothesis,  H0,  can  be  rejected.  The  risk  of  rejecting  a  true  HQ 
(TYPE  I  ERROR),  or  the  level  of  significance  (c*  )  is  .05.   It  should  be 
noted  that  any  other  level  of  significance  could  be  selected  simply  by 
selecting  a  different  value  for  w^  from  the  table.  As  an  example,  if 
w  were  1. 64, «. would  be  .10. 

The  area^T,  shown  in  Figure  3,  varies  directly  ascx,.   The 


10 
shaded  area,   2t    is  in  this  case  equal  to«     .      This  region  is  known  as 
the  critical  region  (Ostle,   I963).     If  the  value  of  the  test  statistic 
used  in  a  particular  test  of  a  statistical  hypothesis  falls  in  this 
region,   the  hypothesis  is  rejected. 

If  CKxZif  ,   the  test  is  said  to  be  two  tailed;   that  is,    the 
hypothesis  was  rejected  either  if 

w  >  1.96 
or 

w<-1.96. 
The  test  can  also  be  formulated  such  that  the  hypothesis  is  re- 
jected only  if 

w  ;>  I.96 
or  if 

w<  -1.96. 
In  each  of  these  cases  the  probability  of  a  Type  I  error  (rejection  of 
a  true  hypothesis)  is  the  area  under  only  one  tail  of  the  curve  and 
(9)  cK  =    r 

Figure  (4)  shows  the  critical  region  for  the  one-tailed  test. 


/.96 


Figure  k 
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A  test  of  a  statistical  hypothesis  basod  on  either  extreme  of 
the  distribution  is  a  one  tailed  test.     It  is  interesting  to  note  that 
for  any  symmetrical  distribution  a  ono  tailed  tost  at  100(cx  )'^  signif- 
icance level  is  equivalent  to  a  two  tailed  test  at  100(2<*)$  signifi- 
cance level.      This  is   true  since  the  same  critical  value  of  w  applies 
to  the  two  cases. 

2.11   Test  Procedure 

In  order  to  test  the  hypothesis 


H0  ''^Mo  *s  ^M*e) 


against  the  alternative 


Hl    :«=*o 


o 

At  a  significance  leveled,    the  following  steps  would  bo  followed: 

(1)  Select  a  lovel  of  significance  oc ,   and  find  the  corre- 
sponding Wy  from  the  appropriate  table. 

(2)  Compute  the  unit  normal  deviate 

(3)  Reject  the  hypothesis  if   |w|>wy  . 

For  example,   a  set  of  observations  has  a  sample  mean  X  of  1»50»   and  a 

variance,  6Z,   of  0.25.      Could  tho  true  mean,/c    ,   be  2.00?       The  hypothe- 

sis  to  be  tested  is 

HQ    l/fc-  2.00 

If  «x  is  selected  to  be  0.05»    the  tabulated  value  of  wg  is   1.9&. 

w  =  1.50  -  2.00  s^s-1 
.5  .5 

1  4  1.96 

Therefore,  the  hypothesis  can  be  accepted  at  the  level  of  significance 
selected. 
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The  testing  of  a  hypothesis  such  as  this  has  very  little 

2 
practical  significance  in  geodesy,  since  the  variance,  <S*  ,  is  seldom 

known. 

This  test  has  been  discussed  in  great  detail  because  the  prin- 
ciples involved  apply  to  all  statistical  tests,  regardless  of  the 
statistic,  or  distribution  used. 

2.2  Testing  with  a  Sample  Mean 

The  sample  mean  of  a  set  of  observations  is  defined  as 

n 

X'il  *i 
n  i-1 

where  n  is  the  number  of  observations. 

Statistical  theory  states  that  the  sum  of  normally  distributed 
random  variables  is  normally  distributed  (Hamilton,  1964). 
The  density  function  of  X  is 


6  (2  XT  J  2G2- 


The  standardized  variable  is  then 

w  -  (X  -M)^ 
Following  the  same  method  used  for  an  individual  observation,  the  proba- 
bility that  the  deviation  of  the  sample  mean  exceeds  a  specific  value  is 

P(/(-,gVr/X^^rwy)  -  2F(W/)  -1 
n*  n'2 

To  test  a  hypothesis  from  a  sample  of  n  observations  and  a  sample  mean 
of  X, 

against 

Hi    \M\Mo 
compute 

w  -   a  -/On* 


L3 

The  hypothesis  is  rejected  at  the  100W7.  level  if  M  >  w^. 

With  this  test  a  confidence  interval  is  established  around  X. 
The  probability  that  the  true  mean,,y,  lies  within  this  interval  is 

n  '*•  v\  '<- 

This  test  would  be  useful  to  determine  if  a  new  set  of  obser- 
vations is  part  of  an  established  population  with  a  mean /^ ,  and  a 
standard  error  of  6". 

It  has  been  shown  that  the  probability  of  a  Type  I  error  is 
the  level  of  significance  chosen  for  the  test.   There  is  also  the 
possibility  of  committing  a  Type  II  error,  that  is  the  error  of  accept- 
ing a  false  hypothesis. 

The  probability,/?,  of  a  Type  II  error  is  dependent  upon  the 
specific  alternative  hypothesis  which  is  presumed  to  be  true 

The  probability  of  a  Type  II  error  is 

(10)  -  PC-w^^w.*  w^), 


when  H.    is  true.      From  (1) 


w  -   (X  -yq^ryh, 


If  the  alternate  hypothesis  is  true,  w.  is  distributed  as  the 
unit  normal  deviate,  and 

(U)  Wl  "  n^(*  'Mq  +/<o  -AQ 

wl   "  wo  +    {#o   ->?])*■*• 
<T 
This  can  be  written  as 

(12)  wQ  -  Wl  +   QyL   -^n)n3s. 
Equation  (10)   can  then  be  written  as 

(13)  fi    -  P(      -vya<w1  +   (^L   -MQ)nh  ^v^) 

0~ 
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(14)  /#-  P(-wI</a  -/^.  -MA   »'^w,(w^  -  (ilL^Laiill) 
It  should  be  noted  that  as  (M0- M.  )  becomes  smaller  the  proba- 
bility of  not  rejecting  a  hypothesis  when  it  is  false  is  nearly  as  great 
as  not  rejecting  it  if  it  is  true.   The  power  of  a  statistical  test  is 

defined  ad  1  -/3,      The  power  of  a  test  increases  as  /./p  -/!,_   becomes 

<5~ 
larger  or,  as  the  number  of  observations  iacreases  (Hamilton,  1964). 

Figure  5  shows  the  power  of  the  test  for  <x  -  .05  and  .01,  as  a  function 

of  cf  where  <f  -  (M\   -Mri)ni. 


-S.o 


S.o 


Figure  5 
Note  that  1  -  ft  varies  directly  as  oc.   The  value  of  j3  is 
determined  by  the  value  chosen  for  crt.   If  the  critical  region  is  in- 
creased, /3  decreases.   Concerning  this  problem  Graybill  (1961)  states, 

"We  should  like  to  minimize  the  likelyhood,  or  proba- 
bility, of  making  either  of  these  two  errors.   However, 
in  general,  for  a  fixed  number  of  observations,  if  we 
decrease  the  probability  of  making  an  error  of  one  type, 
we  increase  the  chances  of  making  the  other." 

Power  curves  can  be  found  in  Table  A-12  of  Dixon  (1957). 


16 

Using 

the  t  di 

stri button 

tho  tost 

H 

0 

:*  =  ^o 

against 

Hi 

»A*/f0 

can  bo  carried 

out. 

First  compute 

t  =  (I 

-M0W  = 

(X 

-Mo>  • 

sa 

:x 

Reject  the  hypothesis  at  the  en    levol  of  significance  if 

|  tl  >   tn-l,*  . 
This  test  is  valuable  to  the  geodesist  attempting  to  deter- 
mine if  a  new  set  of  observations  is  from  the  same  population  as 

—  2 

previous  observations  with  a  mean  of  X  and  a  variance  estimate  s  , 

This  problem  often  arises  in  the  weighting  of  observations  in  an  adjust- 
ment. 

The  following  problem  will  serve  as  an  example. 

In  his  evaluation  of  the  Laser  Theodolite,  Dunn  (1966) 
observed  the  horizontal  angles  between  two  targets  at  various  current 
levels.  At  35  /4   amps  the  angles  obtained  were  (Dunn,  p  52): 
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Left 
3°     17'   22". 2 

3° 

Right 
06'   29". 8 

3° 

Moan 
06'  56". 0 

Residual 
-5.0 

07 

17  .5 

06 

36  .8 

06 

57  .1 

-3.9 

07 

00  .3 

06 

^3  .6 

06 

52  .0 

-9.0 

08 

29  .2 

06 

39  .8 

07 

3^  .5 

33.5 

07 

18  .4 

06 

33  .8 

06 

56  .1 

-4.9 

07 

13  .1 

06 

27  .8 

06 

50  A 

-10.6 

mean 
3°     07' 

26".  8 

3° 

06' 

35".  3 

3° 

07' 

01".  0 

™j  =  1379.83 

£vvj    - 
n-l 

s2  =  276.0 

so 

-  ±  16" 

.5 

The  t  test  may  be  used  to  compare  the  mean  of  one  set  with  a 
grand  mean  of  previous  sets. 

From  the  previous  two  sots  of  observations  the  moan  is 
20    /ramps  3°  06'   51".  1 

30    >yaraps  3°  06'    52".  1 

mean  of  1  &  2     3°  06'   51". 6 
Test  the  hypothesis 


H0   :  M  35  =  30  06'   51". 6 


against 


Hx   :  M  35  $  3°  06'  51". 6 

t  ■  (3°  07'  01". 0  -  3°  06'  51". 6)  (  -/"6) 

t  "  9". 4  (2.45)  =  0.990 
163 

Since  t  <  t^     t$,   this  hypothesis  can  be  accepted  at  the  5$  signifi- 
cance level. 


18 
2.4  Tests  Involving  Variance 

The  statistic  °)C      was  introduced  by  Po.irson.      It  is  defined  by  the 
density  function 

(,7)    ^2)'  -l^¥(%)  e'^W**-1 

for      *X2  >  O  otherwise    <fi  C%Z)~  O  . 

It  is  also  known  that  the  density  function  for  the  variance  estimate 
s^  from  a  sample  size  n  of  a  population  with  a  true  variance  B  is 
(Hamilton,   1964)  v 

for      s*   >0  otherwise     $(s*)  =  & . 

Setting  TC  -  ^s*     these  two  density  functions  are  identical.      The 
value  of  V-^l  is  distributed  as**!2. 
The  value  of  *X*such  that 

are  tabulated  in  Appendix  III. 

TheTC     statistic  can  be  used  to  test  whether  the  true  variance 
estimated  by  s2  is  equal  to  some  variance   6£2.     The  hypothesis  can  be 
stated 

H0   i     62  =  <ro2 

To  test,  compute       °>C     =    — ^y 


^o 


The  hypothesis  is  rejected  if 


or 


*X_      <    °X    ^^  t-cx/2 


X.    >    xL  ^)W/2 
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The  uso  of  this  statistic  to  tost  hypothosis  concerning 
goodetic  and  photogrAmmotric  data  is  very  limited.   To  computo 
tho  statistic,  Gx  the  truo  variance  of  the  population  must  bo  known. 
This  quantity  is  raroly  known  for  tho  types  of  data  analyzed  in  geodesy, 

One  application  of  the  Chi-square  test  would  be  in  trian- 
gulation.  The  square  of  the  dosirod  standard  error  of  the  not  could 
be  considered  as  tho  true  variance,  6"  .   Tho  Chi-squaro  statistic  com- 
puted at  each  station  in  the  net,  at  the  time  of  observation,  could 
then  be  used  as  a  critorion  for  acceptance  or  rejection  of  the  obser- 
vations. 

The  "X    statistic  can  bo  used  as  a  test  of  distribution.  To 
test  the  distribution,  observations  are  grouped  into  n  groups  accord- 
ing to  their  values.  The  expected  number  of  observations  in  each 
group  is  computed  on  the  basis  of  the  assumed  distribution.   Let  n^  be 
the  number  of  observations  actually  found  in  the  kth  group  and  d^  be 
the  number  predicted  by  the  assumed  distribution.   Tho  statistic 

is  distributed  approximately  as^.„if  each  d^  is  at  least  5  (Hamilton, 

196*0. 

2.5  The  F  Statistic 

The  F  statistic  or  variance  ratio,   first  introduced  by  Fisher, 
is  a  useful  statistic. 

F  is  defined 

(18)  F^iv2  "  UlhQ/iY7/*2) 

where  Yj[  is  distributed  as'X'with  v>  degrees  of  freedom,   and  Y2," 

INDEPENDENT  of  Yi,   is  distributed  as^with  ^degrees  of  freedom. 


20 
The  probability  density  function  a.';  given  as 
0(F),     T(rV?,    +VJ/2)      /g\X  rr^)(  ,+.\3A"-L^-e 
for  F  ;>  0  otherwise       0(F)  =0. 

It  was   shown  previously  that 

2 

For  two  samples  from  two  normal  populations  s^    is  distributed  as 


z 


Thus  the  hypothesis  that  the  samplos  were  drawn  from  populations  with 
identical  variances 


can  be  tested  against 


compute 


"o  >®i2=«22 


%  =6i2  «f^2 


IP        *#    * 

F  =  Sl  /s2   . 
The  null  hypothesis  is  rejected  if  the  computed  statistic  falls 
in  the  selected  critical  rogion  at  either  end  of  the  distribution 

F  ^F*l,  v?2,  1  -«72  or 

F  ^  F\ylf  \72>  0^/2        (Dixon,  1957). 
The  inequalities  shown  above  depend  upon  tho  definition  of  ot.   , 
In  some  literature  they  will  appear  reversed. 

Tables  of  the  percentiles  of  the  F  distribution  can  be  found 
in  handbooks  and  statistics  texts  such  as  Dixon  and  Massey,  Table  A-7c 
Appendix  IV  shows  a  sample  of  such  a  table. 

As  an  example  of  the  F  test,  the  data  takon  by  Dunn  (1966)  and 
Sprineky  can  be  analyzed. 
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Lt,  Dunn  made  Jk   observations  of  a  horizontal  angle  with  tho 
Laser  Theodolite. 


The  values  were 


06 •  56". 1 

06  44  .7 

06  50  .4 

06  51  .6 

06  47  .2 

06  50  .6 

06  56   .2 

06  ^9  .0 

06  53  .9 

06  51  .1 

06  5^  .6 

06  47  .7 

06  40  .5 

06  37  .7 

06  40  .2 

06  45  .5 

06  45  .8 

06  49  .7 

05  56  .0 

06  57  .1 
06  52  .0 
06  56  .1 
06  50  .4 
06  35  .0 

06  22  .4 

07  00  .1 

06  58  .4 

07  04  ,3 
06  45  .3 

06  19  .6 

07  00  .7 

06  34  .8 

07  17  .6 
06  18  .2 


Mean  of  34  observations  3°  06»  48". 3 
Standard  deviation  +  12". 4 

Sample  variance  153.8 
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Using  tho  samo  equipment  Capt.  Sprinsky  obtained  the  following 

5°  27'  19". 9 

26  58  .4 

27  00  .0 
26  53  .** 
26  5^  .3 

26  57  .1 

27  13  .7 
27  22  .6 
26  53  .2 

26  41  .0 

27  07  .1 
27  13  .2 
26  48  .1 
26  41  .4 
26  30  .7 
26  44  .5 
26  49  .5 
26  45  .0 
26  52  .2 
26  33  .2 
26  38  .8 
26  39  .2 

.  26  38  .0 

26  28  .1 

26  40  .2 

26  29  .1 

27  10  .7 
26  42  .7 

Mean  of  28  observations  5°  26 •  50". 37 

Standard  deviation  +  14". 3 

Sample  variance  204.5 

Were  the  true  variances  of  these  two  sets  of  observations  equal? 

As  a  hypothesis  this  question  can  be  stated 

H0   :<S-X  -<T2 

and  can  be  tested  against  the  alternative 

H  i  :  ert  *t  <o2 

The  test  statistic  is  computed 

F  =  204. 5  »  1.33 
153.8 
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From  a  table  of  percentiles  of  F  distribution,    such  as  tho  sample  in 
Appendix  IV,   the  values  of  F  are  found  to  be 

F28,  35.  2.5k  ■  -^ 

F28,  35.  97.5*  =  2-01   - 
Since 

F  >F28,   35,  2.5$ 
and 

F^F28,  35.  97.5%    . 
•we  can  accept  the  hypothesis  at  the  5$  significance  lovel.     It  can  then 
be  concluded  that  the  observations  of  Lt.   Dunn  and  Capt,   Sprinsky  have 
the  same  true  variance. 

The  following  example  will  demonstrate  another  use  of  the  F 
test. 

In  his  study  of  the  Kern  DKM  3  thoodolite  Abby  (I965)  measured 
angles  with  a  Wild  T-3.  and  tho  DKM  3  using  the  center  wire  and  the  5 
wire  field.   These  results  wore  obtained: 
Instrument    Observations 


1 

T-3 

16 

2 

T-3 

16 

3 

DKM  3 

15 

4 

DKM  3 

10 

5 

DKM  3 

10 

6 

DKM  3 

30 

res 

s 

s 

1 

0".99 

0.980 

1 

.78 

.608 

5 

M 

.221 

5 

M 

.19^ 

1 

.66 

.436 

5 

.61 

.372 

The  F  statistic  can  bo  used  to  assist  in  drawing  conclusions; 
(a)  Are  the  5  wire  observations  significantly  botter  than  the 
1  wire  observations  with  the  DKM  3?  The  hypothesis  to  be  tested  is 


2h 

H0  :  G^  *e22 

For  this  example  a  one- tailed  tost  will  bo  used.   Tho  null  hypothesis 

can  bo  rejected  if 

Sj2v/Tl  -«*)  (ni  -  1)  (n2  -  1). 
s2 


Comparing  sets  3  and  5 


F  =  0.^36  ■  1.973 
0.221 


F(14,  9)  5£  =  3.00 
The  difference  is  not  significant  at  the  5$  level. 


Using  sets  4  and  5 


F  ■  0.436  =  2.247 


F(9.  9)  5^=  3.18 
The  difference  is  not  significant  at  the  5$  level. 


And  using  sets  5  and  6 


F  =  0.436  =  1.172 
0.372 


F(15.   29)  5$  =  2.03. 
Again  the  difference  is  not  significant  at  the  5$  level. 

(b)  Are  the  5  wire,   30  observation  sets  significantly  better 

than  the  5  wire  10  observation  sets? 

F  =  0.372  =  1.918 
0.19^ 

F(29.  9)  5$  =  2.86 

The  difference  is  not  significant. 

(c)  Is  the  DKM  3  significantly  better  than  set  1  with  the 
T-3? 


25 

F  =  .980  =  2.2^-7 
73*55 

FU5.  15)  5*  *2M 
Again  tho  test  shows  that  tho  difference  is  not  significant. 

(d)  Is  the  DKM  3»  5  wire  procedure  significantly  better  than 
the  T-3? 

F  =  0.980  =  hM     and     F  =  .608  =  2.7^ 
0.221  .221 

F(15,  14)  5^  ■  2-^ 
The  test  shows  that  the  DKM  3»  5  wire  procedure  is  significantly  better 
than  the  T-3. 

This  problem  also  shows  that  statistical  tests  are  not  in- 
fallible. From  parts  (a)  and  (b)  one  can  conclude  that  the  dif- 
ference in  procedure  is  not  significant.  Part  (c)  draws  the  con- 
clusion that  the  1  wire  DKM  3  procedure  and  the  T-3  are  not  sig- 
nificantly different,  yet  (d)  concludes  that  the  5  wire  procedure 
is  significantly  different. 

Statistical  inference  must  be  used  with  caution.  Judgement 
must  also  be  used  in  drawing  conclusions. 

2.6  Testing  Correlations 

The  distribution  of  the  sample  correlation  coefficient,  r, 
is  very  complicated.  Often  in  geodesy  it  is  valuable  to  know  if 
the  true  correlation  coefficient,  p  ,  is  zero.  If  p   is  equal  to 
zero  the  statistic 


r  Vn  - 
-/l  -  r 


-  2 

7 
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is  distributed  as  Student's  t  with  (n-2)  degrees  of  freedom  (Guttman, 
1965). 


To  test  the  hypothesis 


against 


compute  the  statistic 


Ho  :/  =  0 
Hi  : f  ±   0 


C  =    r  Vn-2 
Vl-r^ 

If  C  ,>  tn-2  <=</2  "^H®  hypothesis  may  be  rejected  in  favor  of  the 

alternate. 

To  test  the  hypothesis 

H0  s  $>  =  f,  f   where  ff  £  0 

we  must  know  the  distribution  of  -P  when  -P  £   0.  In  most  geodetic 

applications  it  is  more  important  to  know  if  correlation  exists, 

rather  than  if  it  might  be  some  specific  value.  For  a  discussion 

of  the  distribution  of  f  £   0  the  reader  is  referred  to  Graybill  (I96I). 
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CHAPTITR  3 

TESTS  INVOLVING  MULTIPLE  VARIABLES 
So  far  the  tests  which  have  been  discussed  deal  only  with  one 
variable      The  techniques  of  Regression  Analysis  allow  the  testing 
of  hypothesis  involving  two  or  more  related  variables. 

3.1  Regression  Analysis 

The  functional  relationship  between  some  observable  L  and 
variables  can  bo  expressed  mathematically 

(19)  L  =  f(Xi,  X2 Xn  /&,  >&2 &,) 

S>£-  is  a  parameter  in  the  function.  This  equation  is  often  abbre- 
viated 

L=  f(Xl,  X2 Xn). 

To  the  statistician  this  equation  is  known  as  a  regression 
function.  The  geodesist  and  photogrammetrist  will  recognize  it  as 
the  mathematical  structure  for  a  method  of  observation  equations 
adjustment  problem. 

The  mathematical  structure  of  the  problem  may  bo  chosen 
by  two  methods.  In  geodesy  the  analytical  consideration  of  the 
phenomenon  involved  is  the  preferred  method.  The  examination  of 
scatter  diagrams  plotted  from  the  observed  data  can  also  yield  a 
workable  structure. 

3.2  Fitting  a  Linear  Mathematical  Structure  by  Least  Squares 

Suppose  that  a  linear  relationship  exists  between  a  dependent 
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variable,  y,  and  an  independent  variable,  x,  such  a3  tho  relation- 
ship botwoen  gravity  and  height  in  froe  air.  From  observations  n 

sets  of  points,  (xj.yj),  (x2,  y2)  (*n»  yn)  a**e  obtained.   The 

sample  structure  can  be  expressed 

y  =y?0  y?,  x 

From  tho  observed  data  we  wish  to  obtain  the  values  of  the  unknown 

Let  us  assume  that  y  for  any  given  value  of  x  is  a  random  variable 
which  is  normally  distributed  with  a  mean  /j0^y/?i  x     and  variance    6~2  , 
We  will  also  assume  that  /%,  and  /3,  do  not  depend  upon  x     .      The  con- 
ditional probability  density  function  of  y  for  a  given  x  can  be  written 

(20)  f(y/x)  =       1        expf-1  (y  -/f-^x)2/. 
The  expected  value  of  y,      E(y/x)  is 

(21)  ^y/x)=/0^x, 

The  conditional  probability  density  function^  of  a  sot  of  n  obser- 
vations (x,y),  (x2,  y2) (x^  y  )  is 


(22)  f,H*M*t  I**  e*r[-^A*-A-A*0} 


Applying  the  principles  of  maximum  likelyhood  (Guttman,  1965)  the 

parameters  which  maximize  the  probability  density  function  must  be 

found.   To  maximize  this  function  the  quantity 

,2 


(23)        s  -  £  (Vi  -/?o  -^x.) 


For  proof  see  Guttman  (1965)i  Appendix  III 
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must  be  minimized  with  respect  to  /&„  and/3,  .      The  process  of  esti- 
mating B0  and  &x    by  minimizing  the  sum  of  the  squares  of  the  rosiduals 
is   the  well  known  method   of  least  rouarcs.      The  value  of  /oo  and    /3 
which  minimize  F(^  fi)  are  thoso  for  which 
(2k)  AflSiAJ    =o 

dFtfsJJ    =o. 

Z  A 

It  should  bo  noted  that  for  a  normal  distribution,  the  method  of 
least  squares  gives  a  maximum  likelyhood  estimate  of  the  parameters. 
If  the  error  in  the  observed  quantities  is  not  normally  distributed, 
the  method  of  least  squares  will  give  a  minimum  variance  estimate  of  the 
parameters  but  this  estimate  will  not  be  the  maximum  likelyhood  estimate. 

Taking  the  derivitives  of  (23)  we  obtain 

(25)        -2  f  (y4  -/?0  -tflH)   -  0 

These  equations  can  be  rewritten  as 

n 


In  this  form  they  are  referred  to  as  the  normal  equations.  In  matrix 

form  this  equation  is  the  formula 

NX  +  U  =  0. 

If  the  determinant  of  N  is  not  zero  a  solution  to  this  equation  exists. 

The  values  are 

bo  -  y   "  bl* 

b]L  "  (xt   -  x)(yt    -  y) 

(Xi    -"xT7 
where  bQ  and  \>i  are  estimates  of  G    and   A    (Guttman,   I965). 
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Graphically  all  values  of  x  and  y  could  bo  represented  as  a 
straight  line  in  tho  xt  y  plane  with  intercept  bQ,  and  slope  b^.   This 
line  is  called  the  regression  line. 

The  unknown  parameters  can  be  found  using  the  matrix  oquation 

NX  =  -U 
or  X  =  -tr1u 

the  variance  of  unit  weight,  sQ  ,  (or  mQ2§  as  frequently  used  in 
geodetic  literature)  is  given  by 

(28)  s02  =  V'PV 

n-u 

In  the  linear  regression  problem  u  is  2.     From  the  matrix  adjustment 
we  also  form  the  weight  coefficient  matrix,   Qx.      Tho  variance  - 
covariance  matrix  £.,   is  formed  by  multiplying  Qxby  s0.     In  the  linear 
regression 

(29)  ^        =  k02 

These  quantities  are  the  statistics  which  will  be  used  in  the  testing 
of  hypotheses  concerning  this  regression. 

3.3  Tests  of  Hypotheses  in   a  Linear  Regression 

In  most  linear  regressions  tho  estimator  of  greatest  impor- 
tance is  the  slope  of  the  line,  b<.   To  test  if  the  slope  is  signifi- 
cant]^ different  from  some  hypothesized  value,  say  /?  the  t  statistic  ii 
used.  Stating  the  hypothesis 

the  alternate  is 


31 
the  statistic  computed  is 

(30)  t  =  (bt  -/?j') 

The  hypothesis  is  rejected  if 

t  *  fc(l-  /2)  (n-2) 
or 

t  "  "*(i-  /2)      (n-2) 
It  should  be  noted  that  if  y  is  independent  of  x,    ^  will  be  0.     We 
can  test  for  this  by  setting  /^    equal  to  0,    then 

(3D  t;    =  b,-    -  0 

would  be  the  test  statistic. 

Other  hypotheses  concerning  the  linear  regression  are 

M  Ho^o^o' 

(2)  H0   i fa  =/?!«   and/?0=/?0«- 

For  hypothesis  (1)  the  t  statistic  is  used 
(32)  t  =  (bQ  -^00 

The  F  statistic  can  be  used  to  simultaneously  test  hypothesis 
(2).   This  method  will  be  discussed  in  section  3.31. 
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Table  2 
Summary  of  Test  Procoduros 


Hypothosis 

Stati  stio 

nation 

Re  j  oc  ti  on  1  le gi  on 

A,  'A' 

t 

30 

/tl       >     V  CM-  2)  tl-«V«? 

A.--AJ 

t 

32 

|t|       >     "fcCV7_£)c,  -o</2) 

A  -A,' 

and 

F 

3^ 
(soc.  3.31) 

1 

The  mathematical  structure  discussed  is  quite  general.  For 
example 

y  =  A+  /? sin  t 

can  bo  handled  by  the  methods  previously  discussed  simply  by  sub- 
stituting 

x  =  sin  t. 
The  expotontial  problem  u  =  2fo         reduces  to  a  linear  form 

log  u  =  log  V  =  <fv. 
Making  the  following  substitutions 

y  =  log  u 

A  -  loc  * 

X   =    V 

We  have  the  problem  in  the  form 

y  -y?0  +/£1x  (Ostle,    1963). 

As  one  example  of  the  goodotic  applications  of  a  linear 
regression  data  presented  by  Laurdla  ( 1965)  can  be  analyzed.  In  this 
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report,  Dr.  Laurila  assumes  that  tho  relationship  between  absolute 

humidity  and  measuring  error  is  linear.   The  mathematical  structure  used 

is 

Dt  -  D0  =  CA  +  K 

where  D  is  tho  observed  distanco,  D+  the  true  distance  of  the  reference 
o  '   w 

base,  A  denotes  the  absolute  humidity,  C  reprosents  tho  humidity 
coefficient,  and  K  is  an  instrument  constant. 

From  a  least  squares  adjustment  of  25  measurements  the  following 
results  were  obtained  (Laurila,  19&5). 


c 

sc 

K 

Sk 

0.97 

+0.10 

-11.4 

+1.2 

Figure  6  is  a  scatter  diagram  plotted  from  tho  data  given  in  the 
report,  for  tho  reverse  readings. 

Using  the  t  test,  hypotheses  about  this  line  can  be  tested.  It 
should  be  kept  in  mind  that  these  tests  are  conditional  tests.  They 
take  into  account  only  the  effect  of  errors  of  the  parameter  tested. 
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(1)  H0   :    C  =  0 

t  =  0.97  =  9.7 
.10 

*2fc,  5:^  ■  2-°6 

This  hypothesis  can  bo  rejected. 

(2)  HQ    :    K  =  0 

t  =  11.4  =  9.5 
1.2 

*2frf   5^  =  2*06 
This  hypothesis  can  also  be  rejectod. 

From  a  combined  adjustment  (Laurila,   I965)  the  parameters  wore 
found  to  be 

c  =  0.66 
K  =  -6.8. 
Were  these  the  true  parameters  for  the  line  given  by  the  forward  readings 
only? 

(3)  H     j  C  =  0.66 

t  ■  .97  -  .66  =  .31  «  3.1 
.10  .10 

^4,  5%=  2'06 
This  hypothesis  may  also  be  rojocted. 

(4)  HQ  :  K  =  -6.8 

t  -   11.^  -  6.8  s  4^  =  3.84 
1.2     1.2 

Again  this  hypothesis  may  bo  rejected. 

In  a  similar  manner  the  linear  coefficients  obtained  from  a 

least  squares  adjustment  may  bo  tested  for  a  zero  value  or  some 

theoretical  or  previously  determined  value. 
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3.31  Matrix  Approach 

The  simplest  way  to  handle  adjustment  computations  is  to  use 
matrix  algebra.   It  would  be  valuable  to  have  matrix  methods  to  test 
statistical  hypothesis  concerning  the  results  of  an  adjustment.   Hamilton 
(1964)  has  given  such  methods. 

From  an  adjustment  computation,  using  the  methods  outlined  by 
Dr.  Uotila  ^i^66)  the  variance-covariance  matrix^,  the  solution  matrix 
X  and  the  estimated  standard  error  of  one  observation  of  unit  weight 
sQ2  can  be  obtained.   From  the  derivation 

Xx  -  s02  Qx  -  sQ2  N"1 
Hamilton  (1964)  shows  that 

(X  -  XJ'Z"1  (X  -  X) 


is  distributed  as 


^  u/u s  Fu,  n  -  u 


°X\   -  u/Cn  "  u) 


by  matrix  algebra 


-  s2Q 


iCi-2/1  "  1  W- 


Applying  Hamilton»s  test 


so 


H0  :  X  -  XH 


where  X^  is  the  hypothesized  value  of  X,  compute 

(34)        Sh  -  1  (X  -  XH)'  Q-1  (X  -%) 
u    u  s^2 

If  SH  exceeds  Fu>  n  .  u,<*the  hypothesis,  HQ,  may  be  rejected  at  the  ex 
IT" 

level  of  significance. 

Hamilton  (1964)  also  developes  the  theory  needed  to  test 

hypothesis  when  constraints  have  been  placed  on  the  structure. 
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Introducing  Lagrange  multipliers,    K',the  function 

0  =  V'PV  -  2K'    (CX  +  Z) 
is  obtained.      Differentiatj  nn  this  expression 

d0  =  2V'PdV  -  2K'CdX. 
Let  X  denote  the  least  squares  estimate  under  the  constraint 

d0  =  2  f-U  +  NX  -  K'CjdX. 
Minimizing  the  residuals 

0  =  2  [-U  +  X'N  -  K'CjdX 
so 

K'C  ■  -U'   +  X'N 
Substituting 

U»   =  X*'N 
where  X*  is  the  best  least  squares  solution  without  conditions,  the  equa- 
tion becomes 

K'C  =  (X  -  X*)'N 
KCN"1    c'   b  XC1    -  X*1    C   =  Z'    -  X*C» 
thus 

K  =(Z'    -  X*C)   (C*N     C1)"1 
eliminating  K  from  the  preceding  two  equations  we  obtain 

(X  -  X*)'N  =  (Z  -  X*C»)   (CN"  C')"lC 
or 

(35)  X'  =  X*'  +  (Z  -  X*'C)  (cn"  c»  )"1cn" 

The  weighted  sum  of  the  squaros  of  the  residuals  is  given  by 

RQ  b  V'PV  +  (X  -  X*)'(A'PA)  (X  -  X*) 
V  is  the  matrix  of  residuals  without  the  constraints. 


33 
The  expected  value  of  Rq  is   (n  -  u  +  b)  GJ    whore  b  is   tho  nurabor  of 
conditions.      The  expression  may  be  rewritten  as 


where 


RU  =  RQ  -  R0 


RQ  =   V'PV 


That  is,  RQ  is  tho  unconditional  least  squares  sum  and  R..  is  the  addi- 
tional sum  of  squares  due  to  the  constraints.  Hamilton  (196^-)  shows 
that 

RH  =  (Z  -  CX*)'  (cU'paHc1)"1  (Z  -  CX*). 
The  ratio 


F 
is  distributed  as 


Ro     Ro 


b    F 


n  -  u 


Substituting  tho  values  of  R„  and  R 

Ik  -  (z  -  ex*)1  (cU'PArVr1  (z  -  ex*). 

R0  (n  -  u)s2 

but 

-1 


A' PA  =  N  =  X 
s^         s^ 

the  variance-covariance  matrix. 

Thus 

(36)  n  -  u  RH  =  (Z  -  CX*)'    (CZC1)-1    (Z  -  CX*) 

b      "  R0  b 

which  is  distributed  as  F^   n_Upo      The  hypothosis  can  be  rojoctod  if 

the  computed  value  of  this  quantity  exceeds  the  tabulated  value  of 

Fb,   n-u,c*  • 
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T 
If  a  theoretical  model  has  parameters  X*  the  hypothesis 

HQ  :  X  -  XT 

that  is 

T 
xl  -  xl 

T 
x2  "  x2 


xi  -  xi 

The  matrices  C  and  W  are 

uCn  -  I  and  uZl  -  XT 

thus  the  statistic  to  be  tested  is 

(X*  -  XT)'  QQ~l   (X*  -  XT)  -  £h 
uTZ  u 

This  is  a  more  general  derivation  of  the  statistic  offered  earlier. 

To  test  a  hypothesis  concerning  the  value  of  a  single  parameter, 
regardless  of  the  values  which  the  other  parameters  have,  the  same  pro- 
cedure would  be  followed. 

HQ  :  xL  -  xLT 

C  -  (1,  0,  0 0) 

T 


W  -  xt 


Then  if  the  hypothesis  is  true 


(n  -  u)  RH  -  (xt*  -  x^)2 

r;  sT2 

9 
where  Sj   is  the  variance  of  Xp  the  statistic  (n  -  u)  ft, 

*o 

is  distributed  as  Flt  n.u,/.#   This  is  simply  the  square  of  the 
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Students  t  for  (n  -  u)  degrees  of  freedom. 

For  a  more  detailed  discussion  of;  the  matrix  approach  the  reader 
is  referred  to  chapter  4  of  Hamilton  (1964). 

To  illustrate  the  matrix  approach,  the  data  of  Hatch  (1964) 
will  be  used. 


15.00 

0.00 

-0.10 

-0.01 

0.01 

-0.20 

-0.03 

0.00 

7.66 

0.13 

-0.20 

-0.01 

0.02 

0.03 

-0.10 

0.13 

7.42 

0.05 

0.00 

0.02 

0.01 

N  - 

-0.01 

-0.20 

0.05 

7.31 

0.00 

0.03 

0.00 

0.01 

-0.01 

0.00 

0.00 

0.03 

0.00 

0.00 

-0.20 

0.02 

0.02 

0.03 

0.00 

2.32 

0.00 

-0.03 

0.03 

0.01 

0.00 

0.00 

0.00 

0.27 

Other 

data  are 

s  -  2.44  4.05 

s2  -  5.95  -2.46 

u  -  7  5.71 

'  -  1.74 

1.00 

-5.30 

-0.12 

For  the  sake  of  an  example,  assume  that  some  theoretical  consid- 
erations predict  X^  to  be: 


xT  = 
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3.66 

0.00 

6.83 
o.oo 

0.00 

0.36 

0.00 


We  wish  to  test  the  hypothesis 


Ho  :  X  =  XT. 


The  computed  values  are 


0.39 
-2.46 
-1.12 

(X*  -  XT)=    1.74 
1.00 
-5.66 
[-O.12J 
sH  =  (x*  -  xT)«  Q"1   (X*  -  xT)  =  158.79  =  3.81 

u  us2  41.65 

From  the  tables  for  F  (Appendix  IV) 

F7.  8,  5£  =  3'5° 
Since  S„    y      **?     8     ^»   ^e  hypothesis  may  be  rejected  at 


the  5$  significance  level.      Rejoction  at  this  level     of  significance  is 
termed  significant.     Can  the  hypothesis  also  be  rejected  at  the  1$ 
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level? 

?7,  8,  17.  -  6»84   . 

Su  .  F7   s   17.  so  the  hypothesis  may  not  be  rejected  at  the  17. 
u  ^ 

significance  level. 

3.4  Complex  Models 

In  many  cases  a  simple  two  dimensional  linear  mathematical 
structure  will  not  properly  represent  the  given  data.   It  may  be  that 
a  polynomial  of  increasing  order 

will  better  represent  the  true  structure.   The  matrix  least  squares 
solution  in  procedure  is  identical  to  the  previous  case.  Using  the 
solution  matrix  (bQ,  b^  ...  b  )  and  the  weight  coefficient  matrix  (Q) 
statistical  hypothesis  may  be  tested  using  the  t  tests  previously  shown. 
The  hypothesis 

»o  s  A  -  °»  <A  -/i'»/2  -/2'  .../?n  vV> 

is  tested  by  computing 

t  -  bj2   -  b£ 


soVqTi   si 


and  tested  against 


Hi   -cx/2)(n  -  u)  . 
The  hypothesis  may  also  be  tested  by 


F  -  bj2 


so/(Hi 


These  F  tests  serve  to  assess  the  significance  of  the  additional 
reduction  in  the  residual  sum  of  squares  achieved  by  fitting  b»s  in  the 
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particular  order  adoptod.      Tho  order  of  fit  is  important.      In  a 
polynomial  this  ordor  is  fixod.     In  other  structures  tho  order  is  deter- 
mined by  the  oquations  of  tho  mathematical  structure. 

Then  fitting  polynomials  Ostle  (1963)  recommends, 

"Rather  than  seek  a  bettor  fit  in  terms  of  a 
higher  degree  polynomial  (i.e.   a  degree  greater 
than  2)  it  is  probably  better  to  cast  about  for 
somo  other  functional  form  to  represent  the  data." 

An  excellent  example  of  regression  analysis  applied  to  a  gravity 
problem  is  given  in  the  article  by  H,  Wolf  (1965).     In  this  paper  Dr. 
Wolf  uses  hypothesis  tests  to  determine  the  systematic  trend  of  gravity 
differences  along  tho  European  Calibration  Lino, 

It  is  evident  that  the  matrix  approach  discussed  in  Section 
3.31  cftn  be  easily  applied  to  complex  models. 

Application  of  hypothesis  tests  to  a  non-linear  model  raises 
another  interesting  problem.     It  cannot  be  shown  that  tho  non-linear 
least-squares  solution  will  always  converge  to  even  a  local  minimum 
value  of  the  weighted  sum  of  the  residuals,   V* PV.      This  problem  is  dis- 
cussed by  Hamilton  (1964).      He  concludes, 

"If  the  estimated  errors  are  small  enough  that 
the  functions  are  truly  linear  over  tho  range 
of  several  standard  deviations  in  each  parameter, 
the  methods  of  testing  linear  hypothesis   , , , 
can  be  applied  in  the  same  way," 

When  individual  parameters  are  tested,   either  by  the  t  or  F 
tests  previously  demonstrated,  any  co-variance  betwoon  tho  parameters 
is  not  considered.     If  the  parameters  were  truly  independent  tho  choice 
between  individual  tests,   i.e.,    the  t  tost,   and  the  simultaneous 
matrix  tests  would  be  one  of  individual  preference. 

If  the  co-variance  between  parameters  is  largo,    the  simul- 
taneous matrix  test  should  be  used. 
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Example:     In  his  work  with  tho  MRA-1   Tolluromoter,   Hatch  (1964) 
investigated  the  mathematical  structure 

T-B  =  bj  +  b2  sin  (2  )r  (A  +  be))   +  b^  sin 

Too 
+  (l12T  ^A  +  b6^  +  b^  sin  *£?"  ^A  +  b7^* 

100  100 

After  least  squares  ad jus tmont, results  for  January  19  were: 

b(l)=  2.78  +  1.29 

b(2)=  0.32  £  1.83 

b(3)=  10.09  ±  1.82 

b(4)=  -2.57  +  1.80 

b(5)=  -0.31  ±  89.63 

b(6)=  23.02  ±  1.45 

b(7)=  -0.06  +  2.92 
He  made  16  observations  thus 

(n  -  u)  a  16  -  7  =9 
degrees  of  freedom. 
To  test 

Ho  :  /?■  -  0 
t.  is  computed  from      t  a  b  /s-u 

tx  a  2.78  a  2.15 
1.29 

t2  =   0^   a   0.175 

1.83' 


t-i  -   16.09  =  5.5^ 
J    1.82 

V=  2^7.  =  1.^3 


1.80 


h', 


tr  =    0.31  =    0.0035 


t6  =  23.02  =  15.90 
t7  =    0.060^    0.0206 

from  the  table 

tc,,  5*-  2-26' 

thus  the  only  parameters  which  arc  significant  at  the  5$  level  are  b-j 

and  b£. 

At  the  1$   level 

*9,  I*"  I'25 
therefore  b^,  bo,  and  b^  are  significant  at  this  level. 

These  tests  at  a  significance  level  of  1$  give  statistical  sup- 
port to  the  conclusions  drawn  by  Hatch  (196*0  that  the  error  can  be  rep- 
resented by 

E  -   bj  +  b^  sin  tofCA  +  b^). 
100 

To  go  a  step  farther  with  tho  data  given  by  Hatch,  the  hypoth- 
esis 

H  t  %  a  0,  i  =  1  to  ? 
was  tested  on  each  set  of  observations  using  the  t  test.   Table  3  shows 
the  results  (Hatch,  196*0. 


U6 


Tablo  3 


Test  of  HQ  :  B.  =  0 


Date/ 

19 

21-22 

8 

W 

15 

17 

Kumbcr  of 

Parameter 

Jan. 

Jan, 

Feb. 

Karen 

April 

April 

Rejections 

Bi 

A* 

A 

R 

R 

A 

R 

3 

B2 

A 

A 

R 

A 

R 

R 

3 

B3 

R 

A 

R 

R 

A 

R 

4 

% 

A 

A 

R 

A 

A 

A 

1 

B5 

A 

A 

A 

A 

A 

A 

0 

B6 

R 

A 

A 

A 

A 

R 

2 

B7 

A 

A 

R 

A 

A 

A 

1 

*  A  signifies  acceptance,  R  signifies  rejection. 

If  the  parameters,  which  rejected  the  hypothesis  more  than  once 
in  the  six  sets  are  used,  the  formula  for  the  cyclic  zoro  error  would  be 

E  =  b1  +  b2  sin  2tt  A  +  bo  sin  JV7T   (A  +  b^)  . 

100  *  100 

As  a  comparison  of  the  individual  t  test  and  the  simultaneous 
matrix  test  consider  the  following  data  taken  on  23  March  by  Hatch 

(196*0. 

2 
n  ■  25,  u  =  7,  sQ  =  +  3.03,   sQ    =  9.19 

B(l)  =      3.66  +      0.65 

B(2)  ■    -1.67  +      0.87 

B(3)  =      6.83  +      0.93 

* 

B(^)  =  1.75  +  0.88 

B(5)  =  21.15  +  15.39 

B(6)  =  0.36  +  1.32 

B(7)  =  1.63  i  ^.79 
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N  ■ 


T*m 


25.00 

-2.95 

-0.26 

2.58 

-0.27 

-0.05 

0.11 

-2.95 

13.97 

-2.82 

-2.23 

-0.02 

1.60 

0.17 

-0.26 

-2.28 

11.62 

-1.58 

0.01 

0.18 

0.14 

2.58 

-2.23 

-1.58 

13.05 

0.01 

-1.92 

-0.11 

-0.27 

-0.02 

0.01 

0.01 

0.04 

-0.08 

-0.01 

-2.05 

1.60 

0.18 

-1.92 

-0.08 

6.16 

-0.28 

0.11 

0.17 

0.14 

-0.14 

-0.01 

-0.28 

0.42 

verso 

matrix  is 

given  as: 

0.0^ 

0.01 

0.00 

-0.01 

0.32 

0.62 

-0.00 

0.01 

0.08 

0.02 

0.01 

0.05 

-0.02 

-0.00 

0.00 

0.02 

0.0Q 

0.01 

-0.01 

-0.01 

-0.04 

-0.01 

0.01 

0.01 

0.03 

0.00 

0.02 

0.03 

0.32 

0.05 

-0.01 

0.00 

2?,8? 

0.45 

0.72 

0.02 

-0.02 

-0.01 

0.02 

0.45 

0^2. 

0.14 

0.00 

-0.05 

-0.04 

0.03 

0.72 

0.14 

2,^ 

From  the  previous  t  tests  one  would  expect  that  B(4),  B(5)» 
and  B(7)  might  be  equal  to  zero.  Constraining  all  values  to  bo  equal 
to  their  test  value,  the  C  matrix  is  equal  to  I,  and  Z  is  equal  to  the 
test  matrix,  X™.   The  hypothesis 


where 


H  :  X=  XJ 
o 


3.7 
-1.7 
6.8 
0.0 
0.0 
0.4 


0.0 
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will  bo  tostod.   Usinr,  equation  (36) 

£h   =  (x*-xT)':i  (x*-x?)  =  44.697  =  0*695    . 

u  us  (7)(9.19) 

o 

Since  SH      ^      F?fl8i5^=2.58  , 

u 
the  hypothesis  is  acceptable   The  values  of  B(4),  B(5),  and  B(7)  are 
probably  equal  to  zoro,  which  is  the  same  conclusion  reached  by  the  t 
test.  The  values  given  for  B(l),  B(2),  B(3)t  and  B(6)  are  not  neces- 
sarily the  final  values.  The  new  structure,  containing  only  these  para- 
meters, must  be  solved  by  another  least  squares  adjustment.  For  the 
data  used  previously  the  results  of  this  readjustment  and  the  change 
from  the  values  given  by  Hatch  are 


Valuo 

Change 

B(l) 

3.^5 

.21 

B(2) 

-1.80 

-.13 

B(3) 

6.73 

.10 

B(6) 

0.28 

.16 

Statistical  tests  could  also  be  applied  to  these  values. 

The  previous  example  could  also  have  been  tested  using  the  "R 
factor"  test  discussed  by  Hamilton  (1964).  This  is  a  test  of  the  ratio 

Rq 

n  »  F°r  ^h®  details  the  reader  is  reforred  to  page  157  of  Hamilton. 
Ro 

To  point  up  the  difference  between  the  individual  tests  and  a 

simultaneous  test,  the  data  of  Laurila  (1965)»  shown  in  Figure  6,  Section 

3.3  of  this  thesis  will  be  used. 

For  this  example  assume  that  some  theoretical  consideration 

predicts  that  the  values  are 


Using  the  t  tost 
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c« 

=  0.206 

K' 

=  2.10 

• 

tK, 

■ 

2.10  n  2 
1.2 

.06 

*c-  = 

0 

,206  =  2. 

06     . 

0, 

,10 

From  tho  tables,  t2/j.  5$  is  2.06;  theroforo,  thoso  valuos  of  K'  and  C 
aro  acceptable. 

The  values  of  K'  and  C!  can  then  be  used  as  tho  test  values  for 
a  simultaneous  F  test. 


(X*  -  X1)  ■ 


F  =[0.454   -4.7] 


0.454 
-4.70 
25.0     272.4 
.272.4    3824.28 


0.454 


(2)(0.2129) 
F  =  195.683. 
The  tabulated  value  of  Tf^Qyfc  is  2.58,thus  tho  hypothesis  can  bo  re- 
jected although  the  test  valuos  are  acceptable  on  an  individual  basis. 
One  would  expect  this,  because  the  Q      matrix  indicates  a  strong  corre- 
lation between  parameters. 
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CHAPTER  fr 

LEVEL  OK  SIGNIFICANCE 
Tho  selection  of  tho  significance  lovol  to  bo  used  in  a 
statistical  tost  is  a  matter  of  judgement  and  experience  on  tho  part  of 
the  geodosist.  It  may  bo  chosen  before  the  data  is  analyzod.   Basi- 
cally tho  problem  is;  how  largo  should  the  critical  region  for  rejoction 
bo,  or  what  is  the  risk  of  committing  a  Type  I  error  which  the  geodesist 
is  willing  to  accept?  Dixon  (1957)  states, 

"If  it  is  a  mattor  of  great  concern  when  a  true 

hypothesis  is  rejected, ex  should  be  small  

if  it  is  a  matter  of  groat  concern  that  a  hypothe- 
sis bo  rejectod  if  thoro  is  little  evidence  against 
it  we  should  use  a  largo  ch.  ". 

A  convention  followed  by  statisticians  is  tho  following:  If  a 
hypothesis  is  rejected  at  fr  =  5$  it  is  said  to  be  significant.  A 
hypothesis  rejected  at  oC   =  l$b  is  said  to  be  highly  significant. 

When  chosing  a  significance  level  for  a  test,  one  must  keep  in 
mind  that  the  acceptance  of  a  hypothesis  is  favored  over  rejoction  of  an 
alternative  by  any  test.  If  an  <X  of  1$  is  chosen,  the  critical  area 
for  rejection  is  small,  thus  rejection  is  less  likely  than  if  a  5$ 
level  of  significance  is  chosen. 

The  choice  between  c<  =  5$  a"d  c<  =  1$  must  bo  dictated  by  the 
circumstances  surrounding  the  problem. 

An  entirely  different  approach  to  hypothesis  testing  can  be 
developed  by  using  a  slightly  different  procedure.  In  this  method  the 
significance  level  is  not  pre-selectod.  The  tost  statistic  is  computed 
by  one  of  the  methods  previously  aiscussod. 

The  probability  (found  in  the  tables)  associated  with  tho  value 
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yieldod  by  tho  data  is  taken  as  an  objective  measure  of  the  dogreo  of 
support  that  tho  data  londs  to  the  hypothosis.   Taking  this  approach 
the  test  of  significance  does  not  lead  to  tho  acceptance  or  rcjoction  of 
the  hypotheses;  it  merely  measures  the  strongth  of  bolief.  In  many 
problems  encountered  in  geodesy  and  photogrammetry  this  may  bo  the 
bettor  approach. 

Examplo:  As  a  final  stop  in  his  analysis  Hatch  (196*0  rep- 
resented each  of  his  regression  coefficients  by  a  linear  regression 

bj  =  Mj_  o  +  c^ 
b2  =  M3  P  +  C3 
b6  =  M6  e  +  C6 

Where  e  is  the  partial  vapor  pressure,  and  P  is  tho  baramotric 
pressure. 

3y  a  least  squares  adjustment  he  obtained  the  following: 
(P.  66) 


Value 

Standard  Error 

Ml 

*H.53 

9.1 

Cl 

-8.08 

2.0 

M3 

-12.09 

40.4 

c3 

359.69 

118.1 

M6 

-144.82 

30.8 

c6 

4-3.80 

6.8 

The  t  statistic  can  be  used  to  determine  the  significance  of 
each  co-efficient 


t   "  2L 


rjZ 


t 

Dogroo  of  Freedom 

Degree  of  Support 

Ml 

4.57 

4 

98,: 

cl 

4.03 

4 

9c; 

M3 

.30 

4 

^  50* 

c3 

3.04 

4 

95* 

H 

4.71 

4 

99* 

% 

6.45 

4 

992 

It  can  bo  concluded  that  thero  is  little  relationship  between 
b2  and  P  since  Mo  is  significant  at  a  lovel  of  less  than  50'J>. 
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CHAPTER  5 

CONCLUSIONS 

This  thesis  has  presented  tho  statu stical  theory  basic  to 
hypothesis  tests.  Some  of  the  statistics  commonly  used  in  hypothesis 
tests  have  boen  discussed,  and  their  applications  to  the  problems  of 
geodesy  demonstrated. 

The  tests  most  applicable  to  gcodotic  problems  are  obviously 
the  ones  which  use  a  statistic  that  does  not  require  the  true  variance 
in  its  computation.  Thus  the  Student's  t  statistic  and  the  F  statistic 
are  more  useful. 

The  t  statistic  is  used  to  test  the  moan  of  a  population  with 

an  estimated  standard  error,  s,  and  a  finite  number  of  observations,  n, 

against  some  other  mean  value. 

t  =  (X  -  /4)  n2" 
so 

Fisher's  F  statistic  is  given  as 

F  =  st2 
s2 

It  can  be  used  to  test  hypothesis  concerning  the  variance  of  two  inde- 
pendent sets  of  data. 

From  the  adjustment  procedures  outlined  by  Dr.  Uotila  (1966), 
all  the  values  needed  to  compute  tests  statistics  are  available.  In 
most  cases  computations  of  the  test  statistic  nood  only  be  carried  to 
three  significant  figures.  This  precision  can  be  obtained  on  an 
ordinary  slide  rule. 

By   using  matrix  methods  tho  dosired  test  statistic  can  be  com- 


puted,  during  the  adjustment  solution,  by  an  eloctronic  computer.   The 
simultaneous  matrix  tost  is  prof or rod  ovor  tho  individual  t  tost  in 
cases  whore  there  is  a  largo  correlation  botweon  paramotors.   The  person 
analyzing  the  data  need  only  compare  the  computed  statistic  with  the 
tabulated  values  to  make  tho  hypothesis  tests. 

The  purpose  of  using  statistical  hypothesis  tests  in  goodosy  and 
photogrammetry  is  to  guide  tho  user  to  tho  bost  conclusions  based  on 
the  data  analyzed.  It  should  bo  emphasized  that  failure  to  roject  a 
hypothesis  doos  not  moan  tho  hypothosis  is  truo.  If,  on  tho  basis  of  a 
test,  a  hypothosis  is  rejectod  the  statement  can  be  mado  that  there  is 
evidence,  from  the  data  analyzed,  that  the  hypothosis  is  not  true. 
Conclusions  drawn  with  the  aid  of  statistical  tests  are  thus  supported 
by  probability  theory,  in  addition  to  the  judgement  and  experience  of 
the  scientist. 
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APPENDIX  I 
CUMULATIVE  NORMAL  DISTRIBUTION 
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Appendix  I 
Cumulativo  Normal  Distribution 


Volws  of  Y  =    fX    <!,(r)  dx  for  A'  =  0.00[0.01  |2.99 


A  » 
1 

0.00 

0.01 

0.02 

0.03 

0.01 

0.05 

0.00 

0.07 

O.OS 

0.00 

O.C 

.:.ooo 

.5040 

,5080 

.5120 

.5159 

.5109 

.5239 

.5270 

.5319 

.5350 

(i.l 

.5308 

.5438 

.5478 

.r,;,17 

.5557 

..-,."00 

.5636 

.,-,075 

.571  1 

.5753 

0.2 

.5793 

,5832 

.5871 

,5910 

.5048 

.5987 

.0020 

.000  1 

.0103 

.01  11 

0.3 

.017!) 

.6217 

.6255 

.0293 

.6331 

.6308 

.0100 

.0  1  13 

.6480 

.(',.-,17 

0,1 

.6554 

.6501 

.6628 

.6664 

.1,700 

.6736 

.0772 

.0808 

.681  1 

.6870 

0.5 

.6915 

.6950 

.6985 

.7010 

.70.",  l 

.7088 

.7123 

.7157 

.7100 

.7221 

O.C 

.  7 '_'.-)  7 

.7291 

.7324 

.7357 

.7380 

.7-122 

.7154 

.7  ISO 

.7517 

.7510 

0.7 

.7580 

.7611 

.7642 

.7073 

.770  1 

.7734 

.770  1 

.7701 

.7823 

.7852 

0.8 

.7881 

.7910 

.7939 

.7007 

.700.-, 

.8023 

.8051 

.8078 

.8106 

.SI  33 

0.0 

.8159 

.SI  SO 

.8212 

.8238 

.826 1 

.8289 

.8315 

.8310 

.8305 

.8380 

1.0 

.8413 

.8438 

.SOU 

.sis:, 

.S50S 

.8531 

.855 1 

.8577 

.8599 

.8621 

1.1 

.8643 

.8665 

.8686 

.8708 

.8720 

.8749 

.S770 

.8700 

.8810 

.8830 

1.2 

.8840 

.8869 

.8888 

.8907 

.8025 

.soil 

.8962 

.8080 

.8997 

.9015 

1.3 

.0032 

.90  19 

.9066 

.9082 

.0000 

.01  15 

.9131 

.01  17 

.9102 

.0177 

1.4 

.0102 

.0207 

.0222 

.9236 

.02.-,  1 

.020.-, 

.0279 

.0202 

.0:i00 

.0310 

1..-) 

.0332 

.9345 

.9357 

.9370 

.9382 

.9394 

.0  010 

.9118 

.0  120 

.0  1  11 

1.6 

.0452 

.9163 

.0171 

.0484 

.01".", 

.9505 

.9515 

.0525 

.o:,:',5 

.9515 

1.7 

.055 1 

.9564 

.9573 

.0582 

.9501 

.9599 

.9608 

.01,10 

.9025 

.9633 

1  8 

.0641 

.96  19 

.9656 

.0661 

.9671 

.007S 

.OOSI, 

,9693 

.9699 

.0700 

1.0 

.9713 

.9719 

.0720 

.9732 

.9738 

.0711 

.0750 

.0750 

.0701 

.0707 

2.0 

.9772 

.9778 

.0783 

.9788 

.9703 

.0798 

.9803 

.9808 

.OS  12 

.9817 

2.1 

.9821 

.0826 

.9830 

.9834 

.0838 

.OS!  2 

.9846 

.9850 

.0S5  1 

.9857 

2.2 

.9861 

.0864 

.9868 

.9871 

.9875 

.0S7S 

.0881 

.9884 

.0SS7 

.9890 

2.3 

.9803 

.9896 

.9898 

.0001 

.0001 

.0000 

.0000 

.0011 

.901.'! 

.0010 

2.4 

.9918 

.0920 

.0022 

.0025 

.0027 

.0020 

.0031 

.9032 

.90:;  1 

.9936 

2.5 

.9938 

.0010 

.9911 

.9943 

.001.") 

.0010 

.00  IS 

.00  10 

.9951 

.00.",  2 

2.6 

.9953 

.9955 

.9956 

.9957 

.9959 

.0!  it  ill 

.0001 

.0002 

.9!  10:; 

.0001 

2.7 

.9965 

.0000 

.0007 

.9968 

.0000 

.0070 

.0071 

.0072 

.9973 

.9971 

2.8 

.9974 

.007". 

.01170 

.0077 

.0077 

.9978 

.0070 

.9070 

.90SO 

.9981 

2.0 

.9981 

.9082 

.9982 

.9983 

,9984 

.OOSI 

.00S5 

.O0S5 

.0086 

.9986 

Taken  from  Appendix,  Table  I  (Hamilton,  196*0, 
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APPENDIX  II 
STUDENT'S  t  DISTRIBUTION 
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Used  in  conjunction  with  problems  in  statistics,  the  tabic  of 
the  Student* s  t  distribution  permits  the  evaluation  of  deviations  ex- 
pressed in  terms  of  estimates  of  standard  errors  for  samples  of  various 
sizes.  A  given  estimate  of  standard  error  is  divided  into  a  difference 
or  deviation,  to  obtain  t  as  a  basis  for  a  test  of  significance.   The 
table  is  entered  with  the  number  of  degrees  of  freodom  determined  for 
the  problem.  The  tabular  entry  in  the  column  is  the  value  of  t 
associated  with  the  probability  level  indicated  at  the  top  of  the 
column.   This  level  expresses  the  probability  of  obtaining  a  difference 
as  large  as  the  one  obtained  due  to  chance.  (Arkin  and  Colton  1959). 
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Student's  t  Distribution 


Degrees  of 
I'Venlom 


Probability 


0.50 


0.10 


0.05 


0.02 


0.01 


1 

2 

:t 

4 
5 
0 
7 
8 
9 
10 

11 
12 
13 
14 
15 
16 
17 
18 
19 
20 

21 
22 
23 
24 
25 
26 
27 
28 
29 
30 

35 

40 

45 

50 

00 

70 

80 

90 

100 

125 

150 

200 

300 

400 

500 

1000 

CO 


1  .  000 

0  816 

.  7C6 

.711 

.727 
.718 
.711 
.700 
.703 
.700 

.697 
.695 
.  69 1 
.092 
.691 
.690 
.689 
.688 
.688 
.687 

.686 
.686 
.685 
.  685 
.684 
.684 
.684 
.683 
.683 
.  683 

.682 
.681 
.680 
.679 
.678 
.(178 
.077 
.077 
.077 
.  676 
.676 
.075 
.675 
.075 
.674 
.674 

.674 


6.34 

2  92 

2. 13 
2.02 
1.94 
1.90 
1  86 
1 .  83 
1.81 

1.80 
1.78 
1.77 
1.76 
1.75 
1  75 
1.74 
1  73 
3 


1 
1.72 

1.72 
1.72 
1.71 
1.71 
1.71 
1  71 


1.70 

1.69 

1  OS 
1 .  68 
1  .08 
1  07 
1  07 
1  .66 
1 .66 
1 .  66 
1.00 
1.66 
1  65 
1 .  65 
1  65 
1 .65 
1.05 

1  64 


12  71 
■1  30 
8, 18 
2  78 
2  57 
2  45 
2.36 
2.31 
2.20 
2.23 

2.20 
2.18 
2.10 


2.09 
2.09 

2.08 
2.07 
2.07 
2.06 
2.06 
2.06 
2.05 
2.05 
2.04 
2.04 

2  03 
2.02 
2.02 
2.01 
2  00 
2  00 
1  .99 
1  99 
1  .  98 
1  98 
1.98 
1.97 
1.97 
1  .97 
1  96 
1.96 


1.96 


11 

82 

0 

96 

•1 

64 

3 

75 

3 

30 

3 

14 

3 

00 

2 

90 

2 

82 

2 

70 

2 

72 

2 

68 

2 

65 

2 

62 

2 

00 

2 

58 

2 

57 

2 

55 

2 

54 

2 

53 

2 

52 

2 

51 

2 

50 

2 

49 

2 

48 

2 

IS 

2 

47 

2 

47 

2 

46 

2 

46 

2 

44 

2 

42 

2 

41 

2 

40 

2 

39 

2 

38 

2 

38 

2 

37 

2 

36 

2 

36 

2 

35 

2 .  35 

2. 

34 

2 

34 

2. 

33 

2 

33 

2.33 


63  6G 
9  0? 

■1  60 
■1  03 
3  71 
3.50 
3  36 
3.25 
3.17 

3.11 
3  06 
3  01 
2  98 
2  95 


92 
90 
88 
86 
84 


2.83 
2.82 
2.81 
2.80 
2.79 
2.78 
2  77 
2  76 
2.76 
2.75 


2.72 
2  71 
2  69 
2.68 
2.00 
2 .  65 
2 .  64 
2.63 
2.63 
2  62 
61 
60 


2 
2 
2  59 


59 
59 

58 


2  58 


•  The  greater  portion  of  thin  table  taken  from  R    A    Fisher's  "Sttttistirnl  Method]  for  Research 
Workers,"  with  the  permission  of  the  author  anil  his  publishers,  Oliver  and  Uoyd,  London 

Source:  Reproduced  by  permission  from  C.  II.  Ooulden,  Methods  of  Statistical  Analy"  (New 
York:  John  Wiley  a\  Sons,  193'J). 


Taken  from  (Arkin  and  Colton  1959) 
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APPENDIX  III 
TABLE  OF  CHI-SQUARE 


61 
The  tabic  of  Chi-squaro  is  cntorod  with  tho  dogrees 
of  freedom  appropriate  to  tho  problem.   Hie  row  for  the  specified 
degrees  of  frocdom  is  followed  across  to  tho  columns  corresponding  to 
c</2  and  1  -  oi/Z   whero  the  thoorotical  values  of  Chi- square  needed  for 
tho  test  are  foimd, 
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Percentage  Points  of  the  x2  Distribution*! 
Values  of  y?„.„,  where  «  is  the  probability  thai  \"  exceeds  v., 


ind 


f, 


<Mv2)  <h7  =  l-aj 


o.ooa 

0.000 

0.075 

(i  ,080 

0.500 

(1.(150 

0.028 

0.010 

0.005 

1 

0.00  + 

0.00  + 

0.00  f 

0.00+ 

0.45 

3.84 

5.02 

6.63 

7.88 

o 

0.01 

0.02 

0.05 

0.10 

1.39 

5.99 

7.38 

9.21 

10.00 

3 

0.07 

0.11 

0.22 

0.35 

2.37 

7.81 

9  35 

1 1 .34 

1 2.8  1 

•1 

0.21 

0.30 

0.  IS 

0.71 

3.36 

9.19 

11.1  1 

13.28 

1  l.so 

5 

0.41 

0.55 

0.83 

1.15 

4.35 

11.07 

12.83 

15.09 

10.75 

6 

0.68 

0.87 

1.21 

1.01 

5.35 

12.59 

14.45 

id. si 

18.55 

7 

0.99 

1.21 

1 .0!) 

2.17 

0.35 

14.07 

16.01 

IS.  IS 

20.28 

8 

1.34 

1.65 

2.  IS 

2.73 

7.3  1 

15.51 

17.53 

20.09 

21.00 

9 

l.7:i 

2.00 

2  70 

3.33 

8.34 

10.92 

19.02 

21.07 

23.59 

10 

2.10 

2.56 

3.25 

3.91 

9.34 

18.31 

20.  IS 

23.21 

25.19 

11 

2.00 

3.05 

3.82 

1.57 

10.31 

19.68 

21.92 

21.72 

20.70 

12 

3.07 

3.57 

4.40 

5.23 

11.31 

21.03 

23.34 

20.22 

28.30 

13 

3.57 

4.11 

5.01 

5.89 

1  2.34 

22.36 

21.71 

27.09 

20.S2 

11 

4.07 

4.66 

5.03 

0.57 

13.34 

23. OS 

20. 1 2 

29. 1 4 

31.32 

15 

4.60 

5.23 

0.27 

7.20 

1  1.34 

25.00 

27.49 

30.58 

32.80 

16 

5.14 

5.81 

6.91 

7.00 

15  ::  I 

26  30 

28.85 

32.00 

34.27 

17 

5.70 

0.41 

7.50 

8.67 

10..;  I 

27.59 

30.19 

33.4  1 

35.72 

18 

6.26 

7.01 

8.23 

9.39 

1 7.34 

28.87 

31.53 

34.81 

37.10 

19 

6.84 

7.63 

8.01 

10.12 

18.34 

30. 1  1 

32.85 

30.19 

38.58 

20 

7.43 

8.20 

0.50 

10.85 

19.34 

31.41 

34.17 

37.57 

40.00 

25 

10.52 

11.52 

13.12 

14.61 

24.34 

37.05 

40.65 

44.31 

40.93 

30 

13.79 

14.0") 

10.70 

18.49 

29.3  1 

4I1.77 

46.98 

50.89 

53.07 

40 

20.71 

22.10 

24.43 

20.51 

39.34 

55.70 

59.3  1 

63.69 

00.77 

50 

27.0!) 

20.71 

32.36 

34.76 

49.33 

07.50 

71.42 

70.15 

70.19 

60 

35.53 

37.  IS 

40.48 

43.19 

59.33 

70.0S 

83.30 

88.38 

91.95 

70 

43.28 

45. 1 1 

48.70 

51.71 

69.33 

00.5:', 

95.02 

100.42 

104.22 

80 

51.17 

53.5  i 

57.15 

60.39 

79.33 

loi. ss 

100.03 

112.33 

110.32 

90 

50.20 

61.75 

65.65 

69.13 

89.33 

113.1  1 

118.11 

124.12 

1 28.30 

100 

07.33 

70.00 

74.22 

77.93 

99.33 

124.34 

129.50 

135.81 

140.17 

Adapted  from  the  tables  prepared  by  Catherine  M.  Thompson  for  Biometrika, 
vol.  32;  reproduced  with  permission  of  the  editors  of  Biometrika. 

t  For  more  than  100  decrees  ,,f  freedom,  percentage  points  x2.,.a  of  the  x2  distribution 
jnay  be  obtained  from  the  two-tailed  percentage  points  Xp  of  the  normal  distribution 
>>y  the  approximate  relation,  y„,a  ss  n  +  (2a)'  2X,>,  with  a  =  /'. 

to  is  thus  the  probability  in  one  tail  of  the  distribution. 


Taken  from  Appendix  Table  III  (Hamilton,    1964) 
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APPENDIX  IV 
TABLE  OF  F 


64 
"  In  the  tablo  of  F  tho  distribution  of  F  is  tabulated  for  tho  5  > 
and  \i»   rosidual  lovcls,   Tho  5^  lovel  valuo  of  F  is  indicated  in  ordinary 
typo*  a"d  th©  1$  lovol  figuros  aro  printed  in  bold  faco  typo.  Entry  in- 
to tho  tablo  is  accomplished  by  reference  to  tho  appropriate  column  for 
tho  degreos  of  froodom  associated  with  th©  greater  variance,  and  to  tho 
appropriate  row  for  th©  degrees  of  frocdom  associated  with  tho  smaller 
variance.  If  th©  calculated  ratio  botwocn  the  two  variances  (F)  exceeds 
tho  value  for  F  indicated  in  tho  body  of  the  tablo  for  the  5$  level, 
there  are  fewer  than  5  chances  in  100  that  the  disparity  between  the 
calculated  variances  is  duo  to  chance;  if  F  exceeds  that  recorded  for 
the  Yjo   level,  tho  probability  is  less  than  1  in  100  that  tho  difforonce 
is  accidental."  (Arkin  and  Colton,  1959). 
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Taken  from  (Arkin  and  Colton  1959). 
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Table  A-7c.   Percentiles  of  the  F(v\,  v*\   Distribution   with   Decrees 
of  Freedom  v,  for  the  Numerator  and  i>»  for  the   Denominator 
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Taken  from  (Dixon  and  Massoy,    1957) 
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